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ABSTRACT 

We give a unique classical solution to initial value problem for a system of partial differential 
equations for the densities of components of one dimensional incompressible fluid mixture driven 
by evaporation. 

Motivated by the known fact that the solution appears as an infinite particle limit of stochastic 
ranking processes, which is a simple stochastic model of time evolutions of e.g., Amazon Sales 
Ranks, we collected data from the web and performed statistical fits to our formula. The results 
suggest that the fluid equations and solutions may have an application in the analysis of online 
rankings. 



Keywords: evaporation driven fluid; non-linear wave; stochastic ranking process; long tail; Pareto 
distribution; 

2000 Mathematics Subject Classification: Primary 35C05; Secondary 35Q35, 82C22 
running head: Mixed fluid driven by evaporation and online rankings 



Corresponding author: Tetsuya Hattori, hattoriOmath . toho kuTac . jp| 



Mathematical Institute, Graduate School of Science, Tohoku University Sendai 980-8578, Japan 
tel+FAX: 011-81-22-795-6391 



1 



2 



1 Introduction. 

Let fi > 0, % = 1, 2, • • •, be positive constants, and consider the following system of non-linear partial 
differential equations for the functions Ui(y, t), i = 1, 2, • • •, and v (y, t), defined on (y, t) € [0, l)xl + : 

duj(y,t) d(v(y,t) Ui (y,t)) . 

di 3y = -/i^(2/^)> « = 1,2,---, (1) 

5>i(y,t) = l. (2) 

We consider initial value problems for smooth non-negative initial data 

Ui (y,0)^0, Q^y<l, i = 1, 2, • - - , 
with the boundary conditions at y = and y = 1: 

u(l-0,t)=0, (3) 

«i(0,t) = -|^-, i = l,2,--., (4) 

for t ^ 0, where, for each i, 

-l 

Uj(z,0)dz, (5) 

o 

and we assume 

Note that adding up ([T]) over i and applying ([2]) we have 

9u(l/,t) 



9y 



J2fjUj(y,t), (7) 



which, with ([3]), determines v in terms of Uj . 

Given the positive constants and the initial data Uj(y, 0), the set of equations (|T|) ([2]) ([3j) 
© defines the evolution of our system. The following arguments and results hold both for finite 
components (i = 1, 2, • • • , N) and infinite components. (In fact, we can extend the system and the 
solution to a case with any probability space 0, by replacing m(y,t) with a measure fj,(du,y,t). 
See [3j for probability theoretic arguments.) 

A physical meaning of the system is as follows. We are considering a motion of incompressible 
fluid mixture in an interval with length normalized to 1, where Ui(y,t) is the density of i-th com- 
ponent at space-time point (y,t). ([2]) implies that we normalize Ui so that it represents the ratio 
of i-th component. We are naturally interested in the non-negative solutions Ui(y,t) ^ 0. v(y,t) 
is the velocity field of the fluid. ([T]) is the equation of continuity, and the right-hand-side implies 
that each component evaporates with rate fi per unit time and unit mass. Note that the set of 
equations ([7|) and ([3]) is equivalent to 



Jy 



v (y,t) = ?Jj / Uj(z,t)dz, (8) 
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which implies that the velocity field, or the motion of the fluid, is caused solely by filling the amount 
of fluid which evaporated from the right side of the point y. In particular, we have no flux through 
the boundary y = 1 (v(l — 0, t) = 0). 

The boundary condition (|4|) at y = is so tuned by the initial data that the loss of mass by 
evaporation is compensated by the immediate re-entrance at y = as liquidized particles, so that 
the total mass of each fluid component in the interval [0, 1) is conserved over time: 

l 

Ui(z,t) dz = pi, i ^ 0. (9) 



o 



In fact, ([9]) is equivalent to ((H), under the conditions (P) © © ©, if supj fi < oo. (See Ap- 
pendix [A]) 
Let 



y c (y,t) = / «i(*,0)dz, 0^2/<l, t^O. 



(10) 



For each t ^ 0, yc(',t) '■ [0,1) — > [yc(0, i),l) is a continuous, strictly increasing, onto function of 
y, and its inverse function y{-,t) : [yc(0,t), 1) — > [0, 1) exists: 

l-y = y>~W Uj (z,0)dz, y c (0,t)^y<l, t^0. (11) 



'y(y,t) 

yc{v-,t) denotes the position of a fluid particle at time t (on condition that it does not evaporate up 
to time t) whose initial position is y. y(y, t) denotes the initial position of a fluid particle located 
a t V (= 2/c(0, t)) at time t. 

With slight abuse of notations, we will often write yc{t) for yc(0,t): 

yc(t) = yc(0, t) = l-J2 P^~ f]t - (12) 

3 

yc : [0, oo) — > [0, 1) is a continuous, strictly increasing, onto function of t, and its inverse function 
to : [0, 1) — > [0, oo) exists: 

yc(t (y)) = y, o^ y <i. (13) 

In this paper, we prove the following. 

Theorem 1 There exists a unique classical solution to the initial value problem for the system, of 
partial differential equations defined by (Q])-(EP, which is explicitly given by |2|) and 

e- fMy) fiPi 

y< yc(t), 



Ui(y,t) 



3 

e~ fit Uj(y(y,t) 
J2e-^ Uj (y{y,t),0) 



3 O (14) 

e ht Ui(y(y,t),0) 

~ft~~~ ' / r , y>yc(t), 



3 

Note the unique feature of the solution that for y < yc{t) the solution is stationary: 

^( y ,t) = 0, y<y c (t), (15) 
while initial conditions affect y > yc(t) part only through wave propagation. 
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In natural phenomena where evaporation is active, such as producing salt out of sea water, 
viscosity, surface tension, and external forces such as gravitational forces dominate, and the effect 
of evaporation on the motion of fluid would be relatively too small to observe. Thus the equation 
and the solution we consider in this paper may not have attracted much attention. However, there 
are phenomena on the web for which our formulation may work as a simplified mathematical model, 
such as the time evolutions of rankings of book sales in the online booksellers. Such possibility 
is theoretically based on a result that (|14p appears as an infinite particle limit of the stochastic 
ranking process [4], which is a simple model of the time evolutions of e.g., the number known as the 
Amazon Sales Rank. (We note that this number has mathematically little to do with the perhaps 
more popular notion of Google Page Ranks.) We collected data of the time evolution of the numbers 
from the web, and performed statistical fits of the data to (I12p . Considering the simplicity of our 
model and formula, the fits seem good, which suggest that there is a new application of our results 
in the analysis of online rankings. 

The plan of this paper is as follows. In Section [2] we give a proof of Theorem [TJ In Section [3] 
we give results of fits to (fl~2j) of data from the web. 

The authors would like to thank Prof. T. Miyakawa and Prof. M. Okada for taking interest in 
our work and for inviting the authors to their seminars. The research of K. Hattori is supported 
in part by a Grant-in-Aid for Scientific Research (C) 16540101 from the Ministry of Education, 
Culture, Sports, Science and Technology, and the research of T. Hattori is supported in part by a 
Grant-in-Aid for Scientific Research (B) 17340022 from the Ministry of Education, Culture, Sports, 
Science and Technology. 

2 Proof of the main theorem. 

Put ! 

Ui(y,t)= [ Ui (z,t)dz, i = 1,2, - - - . (16) 
Jy 

With (fl6l) . the the system of equations ([I])-© is equivalent to the following: Ui(y,t) is decreasing 
in y and Ui(l — 0, t) = 0, and 

^(y,t) + v(y,t)^(y,t) = -fiUi(y,t), i = l,2,---, (17) 

J2u j (y,t) = l-y, (18) 

j 

v(y,t) = J2fiUj(y,t), (19) 
j 

for ^ y < 1, t ^ (note ©), and noting ([9]), 

Ui(0,t)= Pi i = 1, 2, • • • , t^0. (20) 
Now, for each S Ho < 1 let vb = VB(yo',t) be a solution to an ODE 
dyB 



dt 

and put 



-(yo;t) = v(y B (yo; t),t), 0, ysiyo; 0) = y , (21) 
<t>i(t) = Ui(y B (yo;t),t). (22) 



With (I2l|) and flUD it follows that 

-jfit) = -Qf(yB(yo;t),t) + v(y B (y ;t),t) -^j-(yB(yo;t),t) = -fiUi(y B (yo;t),t) = -fi4>i{t), 

hence, with 0j(O) = Ui(yo,0), (pi is uniquely solved as 

<k(t) = U l (y ,0)e-^ t . (23) 

With (USD and j2TJ, we then find 



dys 



(yo;t) = ^2fjUj(yo,o) & 



■fit 



dt 

j 



hence, using ([18]) and ([16]) . we have 

ys(yo; i) = 1 - ^(yo, 0) e~ s = l = 1 - ^ / «,■(*, 0) e - ^* = y c (y , t), 

where yc is defined in ([TO]) , With (fTT|) . ([22]) and (f23]) we uniquely obtain 

Ui(y,t) = U i (y(y,t),0)e-* t . 
Differentiating by y and using (|16p and (|24p 

'9VB,~r .x ^V* ~, .x n x -f,t u i (y(y,t),Q)e-f it 



(24) 



5^«i(y(y,t),0) e 



where — — is the derivative of ye = y B (yo',t) with respect to the parameter y$. This proves AT] 
for y > yc(t). 

Next let y < yc(i) and put t\ = t — io(y) G (0,i), where to is as in (fT3"j). Let y^ be a solution 
to an ODE 

yA( s ) = v(yA(s),s), s ^ ti , y A (t 1 )=0, (25) 

and put 

<f>i(s) = Ui{y A (s),s), s^h. (26) 
Note that ([20]) implies 4>i(t{) = C/j(0, ii) = pj, hence, as below ([2T]) . ^ is uniquely solved as 

<lH(8)=p i e-M'- t i\ (27) 

With (HU) and ([25]), we then find 

Note that © and ([3J imply = 1. Hence, 

i 

y A (s) = 1-J2pj e-M'-* 1 ) = y c (s - h), s^h. (28) 

3 
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where yc is defined in (|T2j) . 

Putting s = t in (|26|) and ([271) . using (f28l) . and recalling that ti = i — io(y)> we have, with (fl3j) . 

Di(i/,t) = Ui{y c {t Q {y)),t) = pte-toM. 

Differentiating by y and using ([HID , CG2) an d (fl2]h 



-/i*o(y) ' 

which proves (I14p for y < yc (t) ■ This completes a proof of Theorem [U 

Before closing this section, we give a couple of examples of the solution to the equation of 
motion. 

Example 1 (One particle type). For a pure fluid, p\ = 1, hence we have u\{y,t) = 1. Since 
there is only one type of incompressible fluid, the density is constant. However, there is a flow 
driven by evaporation even in this case, and we actually have yc(t) = 1 — eT^ 1 ■ 

Example 2 (Two particle types with fa = 0). Consider a mixture of 2 components with the 
ratios satisfying p\ > and p2 = 1 — pi > 0. fa = means no evaporation, so the situation is a 
salty sea with salt density p2, where evaporation and flow from a river balance. In Theorem Q] we 
assumed fa > 0, but we can calculate explicitly for fa ^ 0. We have 

y c (t) = pi(l - e-^), t (y) = ~ log(l - ?L). 

h Pi 

The expressions of Ui(y,t) for y < yc(t) become simple: 

v(y,t) = fi(pi-y), ui(y,i) = 1, u 2 (y,i)=0. 

The pure water from the river comes in up to y < yc(t)- (Note that we are considering a fictitious 1- 
dimensional case where no spacial mixing such as turbulence occurs and we have no other dynamics 
such as diffusion.) 

If, furthermore, the initial distribution is uniform on [0,1): Ui(y,0) = pi, i = 1,2, then the 
expressions for y > yc(t) are also simple: 



y c (y, t) = 1 - (1 - y){pie- ht + pa), y(y, t) = 1 



pie + p 2 ' 
and consequently, 

P\G~ p\C~^ 1 ^ P2 

v{y,t) = (l-y)h — ttvt^ — ' = — txt" — , u 2 (y,t) = — —r^ — , 

pie Jlt + p2 pie Jlt + P2 pie + p 2 

for y > yc(t). (In general, the formulas are dependent on initial data in complex ways, and we do 
not have explicit formula.) 



7 



3 Possible application to rankings on the webs. 
3.1 Pareto distribution. 

An idea that the time evolution of ranking numbers such as in Amazon.co.jp sales rank may be 
explained by evaporation-driven fluid motion, is theoretically based on a result [1] which says that 
(|14p appears as a time evolution of empirical distributions of the stochastic ranking processes in 
an infinite particle limit. Roughly speaking, the stochastic ranking process is a simple model of 
the time evolution of a list of rankings, such as a ranking of book sales in an online bookseller, or 
a table of page titles in a collected web bulletin board. A particle (a book, in the case of online 
booksellers) in the ranking jumps randomly to the rank 1 (each time the book is sold), and increases 
the ranking number by 1 each time some other particle of larger ranking number jumps to rank 
1. Because an increase in ranking number is a result of jumps of very large number of particles in 
the tail side of the ranking, the particle effectively moves on the ranking queue in a deterministic 
way, even though each jump occurs at a random time. The jump corresponds to the evaporation 
in the fluid model. We can therefore predict the time evolutions of rankings appearing on the web 
rankings based on our model. See [I] for details of the stochastic ranking processes. 

Here we try to see how a trajectory of a particle (|12p could be observed in the web rankings. 
In applying (|12f) to the actual rankings, we need to choose the distribution of evaporation rates 
{{hi Pi) I i = 1)2, ■■■,N}. In the case of social or economic studies such as online booksellers, 
this corresponds to choosing the distribution of activities or transactions. In the case of online 
booksellers, we have to chose the distribution of sales rates over books. The Pareto distribution (also 
called log-linear distribution in social studies, or power-law in physics literatures) is traditionally 
used as a basic model distribution for various social rankings, perhaps a most well-known example 
is the ranking of incomes. Let N be the total size of population, and for i = 1, 2, • • • , N, denote by 
fi the income of the i-th wealthiest person. If 

/N\ l/b 1 

fi = a [-) > P i = N' i = l,2,3,...,J\r, (29) 

holds for some positive constants a and b, then the distribution of incomes is said to satisfy the 
Pareto distribution. (The Pareto distribution assumes all the constituents to have distinct fi , 
which leads to equal weight pi = 1/N in our notation.) The constant a corresponds to the smallest 
income, and the exponent b reflects a social equality of incomes: in fact the ratio of the largest 
income to the smallest is /i//jv = N 1 ^, which is close to 1 if b is large (a fair society), while is 
large (society is in monopoly) if b is small. (Our b corresponds to a in a standard textbook on 
statistics, 9 in [3], and —I/P2 m [2]) 

Substituting ([29]) in (fTZj) of Section [21 and approximating the summation by integration, we 
have, after a change of variable, 

y C (t) = 1 - b(at) b T(-b, at) + 0(N~ l ), (30) 

/•oo 

where T(z,p) = / e~ w w z ~ 1 dw is the incomplete Gamma function. yc(t) is a relative ranking 
Jp 

normalized by N, so the time evolution of ranking xc(t) is 

x c {t) = l + Ny c {t). (31) 
The 0(N^ 1 ) contribution in (j30|) is (by a careful calculation) seen to be non-negative and bounded 
by J^ e ~ at = leading to a difference of at most 1 in the ranking xc{t), which is insignificant for 
our applications below, so we will ignore it. 



8 



Note that T(—b, at) — > oo as t — ► 0, for b > 0. This divergence is harmless because it is cancelled 
by t b in (|30f) . but for numerical and asymptotic analysis, it is better to perform a partial integration 
on the right-hand side to find 

y c (t) = 1 - e~ at + (at) b T(l - b, at), (32) 

which, with (|31j) . leads to 

x c (t) = N(1- e~ at + (at) b T(l - b, at)) + 1. (33) 

The constant a, which denotes the lowest income in the Pareto distribution, has a role of a time 
constant in ()33[) . In particular, the short time behavior of xc(t) for < b < 1 is 

x c (t) = ct b + 0(t), (34) 

where 

c = Na b T(l-b). (35) 
For 1 < b < 2 we need a partial integration once more for a better expression; 

y c {t) = 1 - e-*(l - ^) - ^ r(2 - 6, at) = ^ t - T(2 - b) t fc + 0(t 2 ). (36) 

Note that for < 6 < 1 the leading short time behavior is yc(t) = 0(t b ), which is tangential to 
the y axis at t = 0, while for b > 1 (the case 6^2 can be handled similarly) the linear dependence 
Vc(t) = 0(t) is dominant for small t. 

3.2 2ch.net bulletin board thread index listings. 

2ch.net is one of the largest collected web bulletin boards in Japan. Each category ('board') has 
an index listing of the titles of 'threads' or the web pages in the board. The titles are ordered by 
"the last written thread at the top" principle; if one writes an article ('response') to a thread, the 
title of that thread in the index listing jumps to the top instantaneously, and the titles of other 
threads which were originally nearer to the top are pushed down by 1 in the listing accordingly. 
We can extract the exact time that a thread jumped to rank 1, because the time of each response 
in a thread is recorded together with the response itself. All these features of the 2ch.net index 
listing match the definition of the stochastic ranking process in [3] . 

As a first attempt to apply our theoretical results to online data, we collected data of the time 
evolution of the index listing and performed statistical fits of the data to (|12p . The actual properties 
of jump rates would be more involved than the models; for example, the distribution of jump rates 
(namely, the distribution of the frequencies of responses of the threads) would be more complex 
than the Pareto distribution, and looking into actual data would provide a test to the applicability 
of our simple model. 

We note that we can use deterministic (non-stochastic) formula such as (|33|) if N, the total 
number of threads in the board is large. When we are keeping track of the ranking of a thread, 
it goes down (number increases) when and only when other threads at the tail side of the ranking 
jumps to the top (i.e., someone writes a response to one of these other threads), and though each 
jump occurs randomly, since there are O(N) threads on the tail side of the thread in question 
(unless it is extremely near the tail), a law-of-large-numbers like mechanism works (as rigorously 
proved in [1]), and the time evolution of the ranking for each thread becomes deterministic as in 
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(|33p . In the case of 2ch.net, N is roughly about 700 to 800, so we would expect fluctuation of a 
few percent, and up to that accuracy, we expect a time evolution predicted by (|33|) . 

We also note that the time evolutions are independent of which thread one is looking at, 
because the changes in the ranking are caused by the collective motion of the threads towards the 
tail; popular threads jump back to the top ranking more frequently than the less popular ones, 
but as long as the threads remain in the queue (i.e., before the next jump), both a popular thread 
and an unpopular thread should behave in the same way, depending only on their position in the 
ranking. 

ranking 




I '- ^ V > tn 

12:00 24:00 



Fig 1: Record of ranking changes in an afternoon for 12 threads in a board of 2ch.net. Points from 
a thread are joined by line segments to guide the eye. 

Fig. [T]is a plot of the threads in a board which jumped to the rank 1 during active hours one day 
and stayed in the queue without jumps until midnight. There were 12 such threads. The ranking is 
obviously monotone function of time between jumps, and there are no overtaking, so that the lines 
do not cross in the figure. Fig. [2] is a plot of same data as in Fig. [TJ except that, for the horizontal 
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Fig 2: Collection of 12 threads in a board of 2ch.net, same as in Fig. [TJ For each thread, time is 
shifted so that the rank of the thread is 1 at time 0. The curve is xc(t) of (I33p with the best fit 
a = a* and b = b* to the data. Horizontal and vertical axes are the hours and ranking, respectively. 

axis the time is so shifted for each thread that the ranking of the thread is 1 at time 0. Though 
each thread starts at rank 1 on different time of the day, Fig. [2] shows that time evolutions after 
rank 1 are on a common curve. N is the total number of threads, which is N = 795 at the time 
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of observation for Fig. [2 and a and b are positive constant parameters to be determined from the 
data. We performed a least square fit to (|33|) of rid = 117 data points shown in Fig. [2j The best 
fit for the parameter set (a, b) is (a*,b*) = (3.3425 x 10 -4 , 0.6145) {^x 2 / n d — 1.8). In particular, 
we see a rather clear behavior close to the origin that the plotted points are on a curve tangential 
to y axis, indicating xc(t) = 0(t b ) with b < 1 as in (f34"|) . Considering the simplicity of our model 
and formula, the fits seem good, suggesting a possibility of new application of fluid dynamics in 
the analysis of online rankings. 

3.3 Amazon.co.jp book sales rankings. 

We next turn to the ranking in the Amazon.co.jp online book sales. In this century of expanding 
online retail business, the economic impact of internet retails has attracted much attention, and 
there are studies using the sales rankings which appear on the webs of online booksellers such as 
Amazon.com We will study Amazon.co.jp, a Japanese counterpart of Amazon.com, which 

seems to be not studied (and easier to access for the authors). Basic structures of web pages for 
individual books are similar for Amazon.com and Amazon.co.jp; on a web page for a book there 
are the title, price and related information such as shipping, brief description of the book, the sales 
ranking of the book, customer reviews and recommendations. 

We should note that Amazon.co.jp, as well as Amazon.com, does not disclose exactly how 
it calculates rankings of books. In fact, there are observations [6j that Amazon.com defines the 
rankings for the top sales in a rather involved way. Therefore, it would be non-trivial and interesting 
if we could observe in the data behaviors similar to those of our simple model such as (133h . See [5] 
for economic implications of the ranking numbers in Amazon.co.jp. 

According to observation, Amazon.co.jp, as well as Amazon.com, updates their rankings once 
per hour, in contrast to the 2ch.net where the update procedure is instantaneous. This implies 
a limit of short time observational precision of 1 hour. On the other hand, for the long time 
observations, we have to consider a fact that the total number of books N is not constant. It is said 
that each year about 5 x 10 4 books are published in Japan, or about 5.7 books per hour. Certainly 
not all of the books are registered on Amazon.co.jp, so the increase of N per hour must be less 
than this value. Speed of ranking change decreases in the very tail side of the listing, and these 
practical changes in N will affect validity of applying (133j) to the data in the very tail regime of 
the ranking. This gives a practical limit to long time analysis. Fortunately, at ranking as far down 
as 6.5 x 10 5 , we still observe about 200 ranking change per hour, which makes an increase by 5.7 
books negligible, so we expect a chance of applicability of our theory for long time data. 

We will now summarize our results. The plotted 77 points in Fig. [3] show the result of our 
observation of a Japanese book rankings data, taken between the end of May, 2007 and mid August, 
2007. As seen in the figure, the ranking number falls very rapidly near the top position (about 200 
thousands in 5 days). The solid curve is a least square fit of these points to (j33|) . Amazon.co.jp 
announces the total number of Japanese books in their list, which is a few times 10 6 , but we suspect 
that this number includes a large number of books which are registered but never sell (so that we 
should discard in applying our theory). Therefore in addition to a and b in f|33|) we include N as a 
parameter to be fit from the data. Also, Amazon does not disclose the exact point of sales of each 
book, unlike 2ch.net where the exact jump time of a thread is recorded, so that the jump time to 
rank 1 of a book is also a parameter. The best fit for the parameter set (N, a, b) is: 

(N*,a*,b*) = (8.57 x 10 5 , 3.939 x 10" 4 , 0.6312), (vV/^d ^ 1.4 x 10 4 ). 

Incidentally, we note in Fig. [3] a small jump at about 300 hours. We suspect this as a result 
of inventory control such as unregistering books out of print. Obviously, these controls need man- 
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500000 




500 1000 1500 2000 

Fig 3: A long time sequence of data from Amazon.co.jp. The solid curve is a theoretical fit. 
Horizontal and vertical axes are the hours and ranking, respectively. 



power, so that they appear only occasionally, making it a kind of unknown time dependent external 
source for our analysis. 

All in all, we think it an impressive discovery that a simple formula as (|33p could explain the 
data for more than 2 months. Our way of extracting basic sociological parameters such as the 
Pareto exponent b from the ranking data on the web has advantages over previous methods such 
as in [3], in that because the time development of the ranking of a book is a result of sales of very 
large, O(N), number of books, the book moves on the ranking queue in a deterministic way though 
each sale is a stochastic process. The fluctuations of sales (randomness about who buys what and 
when) are suppressed through a law-of-large-numbers type mechanism [2]. By looking at the time 
development of the ranking of a single book, we are in fact looking at the total sales of the books 
on the tail side of the book. 

The theory of 'long-tail economy' says [lj that each product might sell only a little, but because 
of the overwhelming abundance in the species of the products the total sales will be of economic 
significance: It is not any specific single book but the total of books on the long-tail that matters. 
Our analysis on accumulated effect of products each with random and small sales, is particularly 
suitable in analyzing the new and rapidly expanding economic possibility of online retails, and 
moreover, is natural from the long-tail philosophy point of view. 

Among the parameters to be fit in the Pareto distribution there is an exponent b which is of 
importance in the studies of economy. For example, in the case of distribution of incomes, which is 
usually quantitatively analyzed by the Pareto distribution, small b means that a few people of high 
incomes hold most of the wealth (the so called '20-80 law' is a nickname for the Pareto distribution 
with 6 = 1), while for large b the society is more equal. In the case of ranking of online booksellers, 
large b means that there are many books (books in the 'long-tail' p] regime), each of which do not 
sell much but the total sales of which is significant, further implying strong impacts of online retails 
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to economy [3l [2], while small b favors dominance of traditional business model of 'greatest hits'. 
Our studies on the 2ch.net bulletin board and the Amazon.co.jp online bookseller both consistently 
give the Pareto exponent b ~ 0.6. Existing studies on online booksellers [31 [2] adopt the value of 
b = 1.2 and b = 1.148, respectively. These references also quote values from other studies, most of 
which satisfy b > 1. Note that [6] discovers, apparently based on extensive observations, that in 
the long-tail regime the sales are worse than in the head and intermediate regime, and gives the 
Pareto exponent b = 0.4 in the long-tail regime. Our method gives the total effect of intermediate 
and long-tails, so our value b = 0.6 could be more or less consistent with the observation of [6]. 



A Proof of equivalence of ([9]) and ([4j). 

Here we prove that, if supj fo < do, (J9j) and (jU) are equivalent, under the equations (P) © (|2|) ([3]) 
([5]), with positive constants fo, pi, i = 1,2, • • •. (The extra condition on boundedness of /, is of 
course irrelevant for the finite component cases.) 

First assume ([9]). Then with ([8]) we have v (0, t) = fo pi. On the other hand, integrating ([T|) 

i 

by y from to 1 and using (|9|) and ([3|) we have v(Q, t) Uj(0, t) = fi pi. The two equations imply (jij). 
Next assume (H|) and let 



Tf = (U 1 ,U 2 ,---); Ui(t)= [ 1 Ut (z,t)dz, 

Jo 



i = 1,2,-- - . (37) 



Then ([8]) implies 



;(0,t)=X)/i^(<)- (38) 



Integrating (pQ) by y from to 1, and using ([3]) ((4]) (j38j) . we have 

dlf 



dt 

with 

fiP 



(t) = (Alf)(t), t^O, (39) 



(ATf)i(t) = =ip V fjUjfjt) - foUi(t), i = 1,2,3,---, t^O. (40) 

The definition (0) implies J7j(0) = Pi, hence C/j(t) = Pi,t^. 0, is a solution, implying Q. Uniqueness 
of the solution to ([39]) . a linear differential equation with constant bounded coefficients, proves ©. 
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